Reporting on Covid-19 in Arkansas: Data Journalism
Jour 5283, Fall 2020
Remote Delivery, Monday-Wednesday 9:40 a.m.-10:55 a.m.

Rob Wells, Ph.D.

@rwells1961

Syllabus - Jour 5283: “CNTL” + click for a New Tab

Course GitHub site: “CNTL” + click for a New Tab


Inspiration:


Week #1: Course Tools, R Basics

Monday, Aug. 24

Agenda:

--Discuss syllabus
--Blackboard site
--Teams
--Intellectual Property / Data Sharing Releases
--Arkansascovid.com
--Intro R and R Studio. Open program.
--Read Machlis, Ch. 2
--Advanced learners, see below

Teams

  For the first class, please have Teams installed so we can do some exercises.    
  Teams is free through your university Office365 account. Download the Teams App through the Office365 suite.

https://its.uark.edu/communication-collaboration/office365/office365-desktop-apps.php

Teams Videos

  Microsoft Teams allows us to easily share information through the class or in discrete groups.
  
  Chat in Teams

https://www.microsoft.com/en-us/videoplayer/embed/RE4rLgJ?pid=ocpVideo5-innerdiv-oneplayer&postJsllMsg=true&maskLevel=20&market=en-us

  Create a post

https://www.microsoft.com/en-us/videoplayer/embed/RE2BIrO?pid=ocpVideo0-innerdiv-oneplayer&postJsllMsg=true&maskLevel=20&market=en-us

  How to tag a person in Teams

https://www.microsoft.com/en-us/videoplayer/embed/RWkJ9C?pid=ocpVideo0-innerdiv-oneplayer&postJsllMsg=true&maskLevel=20&market=en-us

R and R Studio

Install R and R Studio.

This is free and open source software. It is not large and doesn't tax the memory a lot. 
R runs on Windows, Mac and Linux, but this course is designed for the Mac version. 
If you use Windows, there may be variations in the lessons and instructions. Please see me      for questions.

Installing R is a two-step process: 
1) Install R, the actual program
2) Install RStudio, a common interface 

1) Download the most recent version of R for Mac:         

https://mirrors.nics.utk.edu/cran/bin/macosx/R-4.0.2.pkg

--If you have a Windows computer, go to: 

https://mirrors.nics.utk.edu/cran/bin/windows/base/R-4.0.2-win.exe

Accept all of the default settings for Mac.

 2) Install RStudio, the interface we use to manage and create R code. Download the open         source edition of R Studio desktop and follow the prompts to install it.   

https://rstudio.com/products/rstudio/download/#download

 Good instructions for installing R

http://www.machlis.com/R4Journalists/download-r-and-rstudio.html

Good overview of the program

https://docs.google.com/presentation/d/1O0eFLypJLP-PAC63Ghq2QURAnhFo6Dxc7nGt4y_l90s/edit#slide=id.p

Intellectual Property / Data Sharing Releases

UofA Rules of Conduct

https://docs.google.com/document/d/1YkdkRIzIs1WQ3P9KIICvHcppfWvwTyo2bRhGwQPsgVE/edit

License Agreement

https://docs.google.com/document/d/1AahzxDOzTf9Z6PBjBvFBOjnn9_BiM4YldnXW-mHZr9s/edit

Read Before Wednesday’s Class

    Machlis, Sharon. Practical R for Mass Communications and Journalism. Chapman & Hall/CRC The R Series. 2018. ISBN 9781138726918 https://www.amazon.com/gp/search?keywords=9781138726918
    Ch. 1, Introduction; Ch. 3, See How Much You Can Do in a Few Lines of Code
   Chapter 2: Get Started With R in a Few Easy Steps
   
   Here is the link to the firts six chapters for free
   

http://www.machlis.com/PracticalR4Journalism/index.html (first six chapters are free)

   Basic tutorial on R:
   

https://profrobwells.github.io/Guest_Lectures/Intro_To_R/R1_Intro-to-R.html


CORRECT LINK: Download the code and try it yourself

  (Left click, download, remove.txt extension)

https://github.com/profrobwells/CovidFall2020/raw/master/Exercises/R1_Intro%20to%20R.Rmd

Advanced Corner

--Import Arkansascovid.com data and summarize Washington County trends

https://docs.google.com/spreadsheets/d/17M92KbKw1nIOD_co11hN_B0o4AmAW2zs1Nb3QGeDl-I/edit?usp=sharing

Wednesday, Aug. 26

Agenda

  Arkansascovid.com
  Intro to Tableau
  Teams
  IRE Conference  IRE20 Conference will be Sept. 21-25 
  Exercises and tutorial

Same Goal, New Direction

Our partnership with Arkansascovid.com has changed.

Misty Orpin has asked the Journalism Department to run the website. https://arkansascovid.com/

I said yes.

It's an incredible opportunity for the class and the School of Journalism.

And here is how we will do it.

--The School of Journalism will have a GA and a Graduate Intern dedicated to basic daily update of the site: home page data, key charts, a few key Tweets per day.

--This class will write stories and produce data visualizations based on Arkansascovid.com data. That was our original goal. We will just be doing more and with different tools, primarily Tableau and then R, and in a different way.

And we will be Tweeting and doing podcasts, so that will be new.

Normally, I do not revise the course plan like this after the first day of class. But what an opportunity for all of us!

You and the School of Journalism and Strategic Media will be running a multimedia data site and Twitter feed with 11,000 followers reporting on one of the most important issues facing society.

What is New

  This course will emphasize Tableau for the next two months instead of R 
  We will shift to a more limited examination of R in October  
  Tableau is less complex than R   
  Homework assignments will involve Tweeting on the account   

IRE Conference: Sept. 21-25

  Fellowship: 

https://www.ire.org/events-and-training/conferences/2020-ire-conference/ire20-fellowships-scholarships

Tableau Introduction

  Please download this tutorial

https://wordpressua.uark.edu/datareporting/tableau/

Nerd Corner

  Those of you who don't need a Tableau introduction, please do this
  1) Create a new Tableau project and link to this Google Sheet

https://docs.google.com/spreadsheets/d/1g-gkjJOr1sKAu6rZHG04XA5_fM_Ma0jLr5r24fwMqiA/edit?usp=sharing You may need to copy the sheet to your own Google Drive to make it work. Let me know what you find out.

  2) Use the Austin Wilkins automated feed of ADH data to replicate any of the arkansascovid.com charts.

https://docs.google.com/spreadsheets/d/1-qxYxhtwZnGTsRbc4_WmPjROCjQZnnMusxpJN4XtVoc/edit?ts=5f4593cd&pli=1#gid=0

Read Before Next Class

Wong, Dona M. The Wall Street Journal Guide to Information Graphics. W. W. Norton & Company. 2013. ISBN 0393347281. https://www.amazon.com/Street-Journal-Guide-Information-Graphics/dp/0393347281 Ch. 1: The Basics

Arkansascovid.com

Gov. Hutchison’s YouTube Channel. Check the Aug. 24 event https://www.youtube.com/channel/UCLJcNdgp2PMEmiqJEoYzqwQ

Importing Census Data examples https://video.uark.edu/media/Census1+-+Importing+Data/1_xykm3ovj

Homework due Saturday, Aug. 29, 11:59 p.m.

This exercise will test out your Twitter skills. It is not a live Tweeting exercise. That will come next week!
Tweeting: Examine the Arkansascovid.com data for Aug. 24.   

https://docs.google.com/spreadsheets/d/1Yq2MdmMfWijmZzJwA2VulWJzOCWSiOa--SasEAGo7XY/edit#gid=1357388768

Create a Google Doc.   

Produce four draft tweets based on new cases, deaths and two other items based on Gov. Hutchison's Aug. 24 press conference, found here:

https://www.youtube.com/channel/UCLJcNdgp2PMEmiqJEoYzqwQ

Provide a Tableau image for each Tweet.   

Post the Google Doc link on the Blackboard assignment.

Week #2: Arkansascovid

Monday, Aug. 31

Agenda

Update on Daily Work Process
Who Are We?  
Review Tweet Assignment. Tableau     
Story Pitches    
Arkansascovid.com workflow    
R Basics    
Prepare Questions for Misty Orpin  
What is Wrong With This Picture    

_________________________________________________________________________________________

Random: My nephew, Nick Wells, on a 40-mile trail run Saturday on Mt Hood in Oregon

_________________________________________________________________________________________

Here’s an update

I made a lot of progress over the weekend figuring out the automation process to gather and crunch the daily numbers. I will discuss this in class but I want you all to check this short video before class tomorrow.

https://video.uark.edu/media/ADH+Data+Flow+to+Arkcovid/0_c6hn4feo

After looking at everything, I turned to R to streamline the data gathering and analysis process. So I will be teaching you guys the basic R in tandem with Tableau. They fit hand in hand.

For this class, the grading be based on a combination of daily task work to keep the site running and a few assignments that highlight the data skills I want you to learn this semester.

  • Daily Tasks

          Some combination of these each week:   
    
          See link to Daily Tasks in Teams    
          Tweeting duties: 
             Prepare and monitor twitter for research requests, feedback, engagement.    
             Draft responses. Post in Teams     
          Data Visualizations   
          Data Maintenance for Site    
          Web design, posting   
          Shorter stories    
          Improvements   
          UATV or other media partnership / appearance 
          Work with another class
          Tasks TBD 

Each week you’ll put a report of your activities in the “Daily Tasks” Assignment in Blackboard. Instructions are on that assignment.

The grading will be guided by this concept: Make meaningful contributions to the website each week. The term “meaningful contributions” is deliberately vague since we are in an evolving process. I don’t want to set a floor for minimal behavior - I am going to push you pretty hard and want you to multitask effectively. I would view the grading like this: Someone will get less than an A if they don’t communicate or flake out on an assignment. If you demonstrate to me that you are trying, we don’t have a problem.

  • Assignments

          There are still four in the semester but there will be more flexibility on when they are handed in and what the content will involve. The basic skills I want everyone to learn remain the same: 
    
          —Carefully reported enterprise stories   
          —Data Visualization in R, Tableau (or Flourish)   
          -R Data Analysis with a basic chart   
          —Major Data Maintenance, Restructuring for Site      

The R and Tableau instructional material will be delivered like before. I will count on you guys to work through the exercises and gain basic competency.

Teams Has Two Standing Documents Posted for Daily Tasks and Story Pitches, which I will discuss below

  • Who Are We?

    I have done a few media interviews since the announcement Thursday. First, the announcement was met by a great deal of enthusiasm in the community, among senior faculty and the like.
    A few questions came up.

    –Will we have the opinions that Misty offered in her Twitter feed? My response is Misty came to her opinions after some very in-depth reporting. My class is oriented towards the Associated Press style of reporting, hard news and analysis. Our posts will be rooted in-depth reporting that is transparent to the readers.

    –How do we respond to people who ask us Covid-19 questions? If you have the answer and can supply the source, then reply, noting you are speaking for yourself. If you are in doubt, say you don’t know and that you’ll raise the issue for the class.

    This should serve as a reminder that everyone in this class will be in a social media fishbowl. People may troll us and look in our social media feeds to find things to embarass the project. Keeping your personal social media feeds professional will be important. This is one thing all journalists need to acknowledge.

  • Tasks To Do

    See document in Teams Suggest a task here. put a comment on it and tag me. If I agree, I’ll add it

  • Story Pitches

    See document in Teams. If you have a story idea, even if it is not fully developed, put it here and the class can examine and discuss it.

_________________________________________________________________________________________

Exercise

Arkansascovid.com Tableau Lesson #2 –Notes are in Teams

https://wordpressua.uark.edu/datareporting/tableau/

Basic R exercise

–Left click on the link below, remove .txt extension, save as all files

https://github.com/profrobwells/CovidFall2020/blob/master/Exercises/Intro%20to%20R%208-20-2020.Rmd

Second option: If that isn’t successful, then go to the https://raw.githubusercontent.com/profrobwells/CovidFall2020/master/Exercises/Intro%20to%20R%208-20-2020.Rmd –Left Click on document –Save as ALL FILES. REMOVE .TXT extension

Third option: Download "Intro to R 8_20_2020.rmd from your browser directly. https://github.com/profrobwells/CovidFall2020/tree/master/Exercises

Ch 1 & 2 of Machlis: Key Points

Reproducible research 
Repetitive tasks in modern newsrooms.  Employment reports, crime stats, budgets
Variables - an R object
Assignment operator <-
Case sensitive
Vector: A vector can only have one type of data - all integers, all strings
Dataframe - like a spreadsheet
Save files - Don’t save workspace: because all of your variables will be stored and re-loaded the next time you launch RStudio. It’s too     easy to forget about previously stored variables that can interfere with later work,

Software packages: tidyverse, rio, pacman

Data Types and R

Machlis: 2.4.2 Data types you’re likely to use often

_________________________________________________________________________________________

Homework due Tuesday, Sept. 1, 11:59 p.m.

  1. Three specific questions for Misty Orpin, our guest speaker Wednesday. Ask about the site’s workflow, her tone and voice on Twitter, her interactions with the public on social media, her concerns about the state data, or something else of significance. Post your questions in Teams

  2. What Is Wrong With This Picture?
    Examine this graphic and the data.

Consider the news reporting on Covid-19 trends in Arkansas and issues with data. Tell me what you think are the issues with this graphic. Post your comment on Teams. One paragraph or so.

Read Before Next Class

Cohen, "Numbers in the Newsroom," Common Mistakes.   
Machlis. Ch 4: Import Data into R    
Basics of Data Analysis (On Blackboard)   

https://learn.uark.edu/webapps/blackboard/content/listContent.jsp?course_id=_276555_1&content_id=_8815471_1

_________________________________________________________________________________________

Wednesday, Sept. 2

Agenda

Guest speaker: Misty Orpin, Arkansascovid.com

Read Before Next Class

Homework due Saturday, Sept. 5, 11:59 p.m.

For Kendal, Obed, Quincy: Tableau Skills Part 2, Sept. 2, 2020 Work on this exercise on calculated fields. Make the assigned graphcs, insert into a Google Doc. Put that in your weekly memo due Saturday.
https://wordpressua.uark.edu/datareporting/tableau/

For Mary and Abby:

  Look in the Machlis book about how to navigate R Markdown files.
  Load this tutorial in your R Studio and run it. 
  Contact me with questions.
  Introduction to R

https://raw.githubusercontent.com/profrobwells/CovidFall2020/master/Exercises/Intro%20to%20R%208-20-2020.Rmd

  --Left Click on document
  --Save as ALL FILES. REMOVE .TXT extension
  

For Katy:

  Continue testing Arkansascovid scripts
  Work with Wells on GitHub account

Read Before Next Class

 Machlis, Ch. 6: Beginning data visualization
 Video: Basic Charts in R  

https://www.youtube.com/watch?v=1EUJ0tsVsUA&t=12s

Refresher: Numbers in the Newsroom (On Blackboard)

https://learn.uark.edu/webapps/blackboard/content/listContent.jsp?course_id=_276555_1&content_id=_8815471_1


Week #3: Data Types. Loading Data

Monday, Sept. 7.

Happy Labor Day! No Class

Wednesday, Sept. 9

Agenda

Weekly memos
Story pitches. Due Saturday with weekly memo
Review Task List on Teams. Tweets
Editing the Tweets. Have the Image Match the Narrative
Volunteer editor W-T-F (pardon the pun)
Audience Engagement. Read the Feed! Twitter Response Hour.
Continue with R tutorial
Exercise: Loading Data from U.S. Census & Student Loans
top_n function makes life easy  
Sorting

Weekly memos

Please supply links to documents and data you worked on.
The memos will be my resource for finding stuff. Help me find things

Story Pitch for Assignment #1. Due With Weekly Memo, Saturday

  Story Pitch for this assignment. 200 Words on your idea, list 3 sources and the data you plan to use.

  Assignment #1: (Due Sept. 21): Managing Data / Static Graphic
  Static Graphic - Managing Data in R / Tableau.        
  Students will use R Studio to gather, analyze and visualize Arkansascovid data and report and write a 600 word story. 

Notes

 --The pie chart focuses the reader on large percentages, and encourages the reader to think of the total
--The stacked bar plot provides the same information, but makes it easier to accurately determine at a glance how large each group is out of the whole.
--This bar chart splits the categories horizontally, and draws attention to how the family members are ordered. It encourages the reader to think about the distribution rather than disconnected categories, and gives a better sense of sense of scale.     
  

Read Before Next Class

Wong, Dona M. The Wall Street Journal Guide to Information Graphics. 
Ch. 2: Chart Smart

Charts_with_ggplot by Andrew Ba Tran

Grammar of Graphics

http://vita.had.co.nz/papers/layered-grammar.html


Week #4: Visualizatiion

Monday, Sept. 14

Agenda

Teams Housecleaning: Master Work Document
Review Task List
Data - R quick viz on publication
GitHub data
Twitter: Voice, engagement, training wheels: sign up
Twitter schedule this week
White House Report
Data Visualization
ggplot2 - charts and maps
Export Static chart 
Discuss Ch 5, Machlis: See Notes

-GitHub Site for Arkansascovid https://github.com/Arkansascovid/Main

-White House coronavirus task force report on Arkansas Places the state in the “red zone” for cases. If someone wants to write a quick piece or do a Tweet based on this, excellent. https://www.documentcloud.org/documents/7204533-Arkansas-9-6-20.html The document was obtained by the Center for Public Integrity, an investigative journalism outlet in DC. https://publicintegrity.org/health/coronavirus-and-inequality/white-house-coronavirus-red-zone-reports-covid/

-Discuss Downloading Data 9-9-2020.Rmd

https://raw.githubusercontent.com/profrobwells/CovidFall2020/master/Exercises/Downloading%20Data%209-9-2020.Rmd

-Discuss

Wong, Dona M. The Wall Street Journal Guide to Information Graphics. Ch. 2: Chart Smart

Exercise

  Tutorial: Graphing Introducton Jan 11 2020.R
  • GGPLOT

    A handy explanation of ggplot and its components

If you’re using ggplot: plus it!
For everything else: pipe it!

aes - reorder in ggplot 
 
geom_point() 
geom_bar()
geom_boxplot() 
  • Data Visualization Intro

    Load tutorial: Basic Data Visualization 12-26-18.R

Download this file and open it in R Studio

Post in Teams Before Next Class

-Read our reader feedback on the new site. Draft a response to this person. We’ll discuss and then send one See “Stacy Robinson Your Post on Twitter” in Files | Teams

https://teams.microsoft.com/l/file/0C1260C1-91AA-433E-9332-662CD6AF7C3F?tenantId=79c742c4-e61c-4fa5-be89-a3cb566a80d1&fileType=pdf&objectUrl=https%3A%2F%2Fuark.sharepoint.com%2Fteams%2FDataJournalismFall2020%2FShared%20Documents%2FGeneral%2FStacy%20Robinson%20Your%20post%20on%20Twitter%209-10-20.pdf&baseUrl=https%3A%2F%2Fuark.sharepoint.com%2Fteams%2FDataJournalismFall2020&serviceName=teams&threadId=19:a669382a31534ac8a1b588012f22be44@thread.tacv2&groupId=8c19e8e1-4bf3-4f65-9afa-6104d37b5bd2

-IRE Conference Schedule

Identify two events you plan to attend and tell the class why

https://www.ire.org/events-and-training/event/4125

Read Before Next Class

Wong, Dona M. The Wall Street Journal Guide to Information Graphics. 
Ch. 3 & 4: Ready Reference and Tricky Situations

Prof. Alberto Cairo on Data Visualization    

https://video.uark.edu/media/Alberto+Cairo+Data+Viz/0_10pblm1b


Wednesday, Sept. 16

Agenda

Data Visualization
ggplot2 - charts and maps
IRE Conference Schedule - what are you attending



> [**GGplot Video from Andrew Ba Tran**](https://www.youtube.com/watch?v=Sx7d7eGRSj0&t=9s){target="_blank"} 

  • Exercise

    Graphing in GGPlot
    https://bit.ly/2Gqjfj4

    Multiple variable in a graph Geom_Line, Geom_point, Geom_bar How to alter the colors in a chart.

Quiz - Math and R

TK TK TK Test Canvas: Excel Quiz

Read Before Next Class

Wong, Dona M. The Wall Street Journal Guide to Information Graphics. 
Ch. 5: Charting Your Course

Albert Cairo, "The Functional Art," Principles of Data Visualization.

http://learn.r-journalism.com/en/wrangling/dplyr/dplyr/


Week #5: Dplyr Bootcamp

Monday, Sept. 21

Agenda

Review Assignment #1   
Assignment1_KEY_StaticGraphic_2_9.R    
Dplyr bootcamp. 
IRE Conference

Assignment #1

  Assignment #1: (Due Sept. 21): Managing Data / Static Graphic
  Static Graphic - Managing Data in R / Tableau.        
  Students will use R Studio to gather, analyze and visualize Arkansascovid data for Arkansas and report and write a 600 word story. 
  
  • **Dplyr Presentation*

    Five basic verbs filter() select() arrange() mutate() summarize() plus group_by()

  • Pipes - a Much-Used Command to Link Filters, Functions

    pipe %>% CMD + Shift + M

    Pipes are a way of chaining commands.
    object %>% operation() —> result

    Presentation from Bob Rudis on Writing Readable Code with Pipes, delivered at the rstudio::conf 2017.
    https://www.rstudio.com/resources/videos/writing-readable-code-with-pipes/

  • Key Concepts - Moving Forward:

    Dplyr: Filters, Grouping, Sorting, pipes %>%
    Pipe shortcut = CMD + SHIFT + M Basic data visualization
    Tidyverse

  • Exercise:

    DPLYR BOOT CAMP 5th Lesson 8-21-2020.Rmd

    Make a Dplyr Cheat Sheet, Hand in on Blackboard by 11:59 pm Monday.

  • You are in Dplyr bootcamp!

-Date Corrected Data https://video.uark.edu/media/Date+Corrected+Data/1_2rmzgzwu

Read Before Next Class

Machlis: Ch. 8 Analyze data by groups
Transforming and Analyzing Data dplyr.pdf, Andrew Ba Tran, Washington Post
Dplyr - Andrew Ba  Tran - pipes-dplyr.pdf

Wednesday, Sept. 23

Agenda

IRE Conference
Dplyr boot camp!!!
There is nothing else. Focus!!!
  • DPLYR

    DPLYR BOOT CAMP 5th Lesson 8-21-2020.Rmd

Code in Fun

Code in Fun

Notes: How Do I?

https://smach.github.io/R4JournalismBook/HowDoI.html

Read Before Next Class

Machlis 
Ch. 13 Date calculations
Dealing-with-dates.pdf by Andrew Ba Tran  

https://github.com/profrobwells/Data-Analysis-Class-Jour-405v-5003/blob/master/Readings/dealing-with-dates.pdf


Week #6: Dealing With Dates

Monday Sept. 28

Agenda

Lubridate
Review Ch. 13 Machlis
Review Tran and Lubridate

Exercises

–Using Lubridate

  The exercise: **TO BE UPDATED**   
  Lubridate_Intro_Feb_20.R

https://bit.ly/2H07YpX Lubridate vignette
browseVignettes(“lubridate”)

  What we will produce    

Read Before Next Class

Machlis
Ch. 9 Graphing by Group
Ch. 11 Maps in R

Wednesday Sept. 30

Agenda

Lubridate

Exercises

**TO BE UPDATED**
Key to Lubridate Questions in Exercise:    

https://bit.ly/2BO92d1

Read Before Next Class

Machlis
Ch. 11 Maps in R

Visual Narrative Tricks by Albert Cairo

https://www.youtube.com/watch?v=TSGaueL4Ggk


Week #7: Maps in R

Monday Oct. 5

Agenda

  Mapping
  

Exercises

      Mapping Demo: Map Demo 6th Lesson 8-22-2020.Rmd
      Mapping Exercise from Machlis book, Ch. 11      

https://bit.ly/2VXCSU2

Read Before Next Class

Joining Dataframes in R  

https://www.youtube.com/watch?v=gLg4D9bMIyc&t=13s


Wednesday Oct. 7

Agenda

  Maps
  Andrew Ba Tran - Mapping

https://nicar.r-journalism.com/docs/ http://learn.r-journalism.com/en/mapping/
Data Cleaning
Disaggregating variables for summation

Read Before Next Class

Ch. 7 Two or more data sets 

Data Wrangling

http://learn.r-journalism.com/en/wrangling/


Week #8: Maps in R

Monday, Oct. 12

Agenda

  Video of Machlis mapping  

https://www.youtube.com/watch?v=HFJOV5XaU_U See R script: Maps in R March 24 2019 https://bit.ly/2FvZDrB

Assignment 2. Graphic with Multiple Data Sources. Due Oct 12

Students will use R Studio to gather, analyze and visualize Arkansascovid data by demographic for       Arkansas using Census data or school district data. Results will be posted on GitHub. 600 Word          story. Data dictionary required
Students will produce publication-ready graphics from data. 
With the assigned dataset, XXXX.csv, students will produce the following tables:
Most common words used in data text field
Based on this information, write a 600 word story, following AP style, that describes potential         newsworthy trends. 
By 11:59 pm., you will submit the following on Blackboard:  
XXX charts in .jpeg format: 1) Common words 2) Common hashtags 3) Date trend chart
One Google Doc with your findings, 600 words. Append at the end a brief data dictionary.
An R script with the coding that shows how you loaded and cleaned the data and produced the charts

Read Before Next Class

Machlis: Ch. 14 Integrate R With Your Storytelling Using R Markdown

Wednesday Oct. 14: Mapping

Agenda

  Adapt Machlis - Maps in R Ch 11 exercise for Median Income in Arkansas
  Maps in R March 24 2019 - Using the Census API

https://bit.ly/2FvZDrB

  Building a Census tract map
  

We will build this

TK TK Images/Income by Census Tract Washington Co.png)    

Exercises

Map Demo: Interactive Map.R
Adapt Machlis - Maps in R Ch 11 exercise for Median Income in Arkansas
![You will make this in the class]TK TK Images/ARmed_income3-22-19.png

Read Before Next Class

Machlis Ch. 15 Simple Web scraping
APIs - basics   

https://medium.com/@LewisMenelaws/a-beginners-guide-to-web-apis-and-how-they-will-help-you-23923a0da450


Week #9: APIs

Monday Oct. 19

Agenda

  Sign up for a census key:

https://api.census.gov/data/key_signup.html What is an API?

Read Before Next Class

Machlis Ch. 12 Putting it all Together: R on Election Day

A gentle introduction to APIs for data journalists:   

https://trendct.org/2016/12/29/fetching-airport-delays-with-python-a-gentle-guide-to-apis-for-journalists/

tran mapping-census-data.html

Wednesday, Oct. 21 APIs

Agenda

APIs
Census API Exercise
tran mapping-census-data.rmd

Read Before Next Class

Machlis Chs. 17 & 18
Census Reporter to look up tables

https://censusreporter.org/


Week #10: APIs

Monday Oct 26

Agenda

  Twitter API
  Twitter Scraping

Exercises

Twitter analysis of Trump Tweets 

http://varianceexplained.org/r/trump-tweets/

Twitter historical API

https://developer.twitter.com/en/docs/tutorials/choosing-historical-api

Read Before Next Class

 Max Harlow Presentation on How to Use GitHub

https://docs.google.com/presentation/d/1MbltRcOerktc-E26HMDjYj0BO9CTubQWu1Z2bB9CpVY/edit#slide=id.g448ccc227721fe56_10

“Connecting the Dots” by Jacob Harris (2015) and discuss how people should or should not be represented through news visualizations.

Wednesday Oct 28: GitHub

Agenda

    GitHub Exercises
    

Resources:

This class is intended to teach you modern workflow techniques for coding. A centerpiece of that workflow is GitHub. This is a website with a system that allows you to collaborate with other programmers on coding projects. It manages versions of software code and is a very popular with the tech elite. 

Your GitHub account, which is public, represents an important professional image. Prospective employers and collaborators will look at your GitHub account.

Create a GitHub account.

https://github.com/

Simplified GitHub- GitHub Desktop

https://help.github.com/en/desktop

Exercises

See Basic GitHub 4-22-19.R

https://bit.ly/2UAMGTd

Installing Git for a Mac - Andrew Ba Tran

Read Before Next Class

--Follow this tutorial

https://guides.github.com/activities/hello-world/


Week #11: GitHub, Data Cleaning

Monday, Nov. 2

Agenda

  Using GitHub
  Set up R Project With GitHub
  Setting up an R Workflow

http://learn.r-journalism.com/en/publishing/workflow/r-projects/

Resources on GitHub

  GitHub flow   

https://guides.github.com/introduction/flow/

  GitHub Guides  

https://guides.github.com/

  Another GitHub guide  

https://andrewbtran.github.io/NICAR/2018/workflow/docs/03-integrating_github.html

Assignment 3: Interactive Map. Due Nov 2

  Students will use R Studio to build interactive maps of Arkansascovid occupational data in        Arkansas. Results will be posted on GitHub. Data dictionary required

Exercises

Extracting Text Strings from data

https://bit.ly/2X5V9jL

Joins in R: 

https://bit.ly/2OFnGJ6

Read Before Next Class

  TK 

Wednesday, Nov. 4

TAKE A DEEP BREATH, AMERICA

Agenda

Joins in R
Data Cleaning and Text Mining
 

Read Before Next Class

Tidy text mining

https://www.tidytextmining.com/tidytext.html#

Bad data visualizations. Data Translation. 

The Journalist as Programmer: A Case Study of The New York Times Interactive News Technology            Department 

http://isoj.org/wp-content/uploads/2016/10/ISOJ_Journal_V2_N1_2012_Spring.pdf

What is code?

http://www.bloomberg.com/graphics/2015-paul-ford-what-is-code/


Week #12: Data Cleaning

Monday, Nov. 9

Agenda

Data Cleaning and Normalization
Review Assignment #3

Exercises

Splitting Hashtags 2-25-19.R   

https://bit.ly/2BQIE2i

R Markdown to distribute findings to Stanford, Feb 2020

https://profrobwells.github.io/HomelessSP2020/SF_311_Calls_UofA.html

Read Before Next Class

Julia Angwin, Terry Parris Jr., Surya Mattu. “Breaking the Black Box: What Facebook Knows About         You,” ProPublica, 2016; 

Nicholas Diakopolous, “Algorithmic Accountability,” Digital Journalism, 2014.

Wednesday, Nov. 11

Agenda

Data Cleaning and Normalization
Text Mining

Exercises

  Kavanaugh text mining story: Text analysis of Brett Kavanaugh’s opinion.  

http://www.storybench.org/bringing-textual-analysis-tools-to-judge-brett-kavanaughs-latest-opinion/

Read Before Next Class

  Text Mining with R

https://www.tidytextmining.com/ Ch. 1 The Tidy Text Format Ch. 2 Sentiment Analysis


Week #13: Text Mining

Monday, Nov. 16

Agenda

Text Mining Exercises
Coulter-AOC Twitter Feeds

Exercise

AOC-Coulter Data Mining Exercise: 
Coulter Tweet Analysis #2 Exercise

https://bit.ly/2TdY9Mv Key to the exercise:
https://bit.ly/2F3brlh

Read Before Next Class

  Text Mining with R

https://www.tidytextmining.com/ Ch 3 Analyzing word and document frequency: tf-idf


Wednesday, Nov. 18

Agenda

  Continue With Text Mining
  Turn your R cheatsheet into a PDF
  Turn your R cheatsheet into a web page on GitHub

Exercise

  Sentiment of Gov. Hutchison's Twitter Feed
  Bigrams: 

http://bit.ly/bigramz

Coulter Bigrams - Score
TK TK Images/Coulter bigram score.jpeg)

Week #14: Web scraping

Monday, Nov. 23

Agenda

Simple Web Scraping

Exercises

Web Scraping in R: Simple Web Scraper 

http://bit.ly/scrapeme

Read Before Next Class


Wednesday, Nov. 25

Agenda

OFF - THANKSGIVING

Week #15: Wrap Up

Monday, Nov. 30

Agenda

Flexible schedule. Reporting Updates

Assignment 4. Interactive Data Visualization. Due Nov 30.

Students will use R Studio to build interactive graphics / maps of Arkansascovid data by school district. 600 Word Story. Results will be posted on GitHub. Data dictionary required

Exercises

Splitting Hashtags 2-25-19.R   

https://bit.ly/2BQIE2i


Wednesday, Dec. 2

Agenda

 Flexible schedule. Reporting Updates

Week #16: Wrap Up

Monday, Dec. 7

Agenda

Flexible schedule. Reporting Updates

Important!

Course Evaluation
Please do me  a favor and evaluate this course. 
It's important to me and the department to get your thoughts
on what worked and what did not. 
If you think it is important, 
then please take five minutes to fill out the survey.

https://courseval.uark.edu/

Wednesday, Dec. 8

Agenda

  Zoom Party

Congratulations!

 We covered a lot this semester

Twitter Exercises

    Coulter Tweet Analysis #2 Exercise
    https://bit.ly/2TdY9Mv
    
    Twitter exploration exercise
    https://bit.ly/2Sqn1j1
    
    Return to Twitter Engagement
    http://bit.ly/2GParD5
    
    AOC Twitter feed   
    https://bit.ly/2Sqn1j1  
    
    Discuss Twitter Metadata     
    https://developer.twitter.com/en/docs/tweets/data-dictionary/overview/user-object   
    
    Study Twitter meta data  
    https://developer.twitter.com/en/docs/tweets/data-dictionary/overview/tweet-object.html
    Look at this example: Ocasio.csv
    
    Twitter analysis of Trump Tweets
    http://varianceexplained.org/r/trump-tweets/
    
    Show Collins results  
  • Bots

      Bot or Not: Difficulty determining a bot on Twitter    
    
      An app that uses machine learning to guess if a Twitter account is a bot   
      https://www.r-bloggers.com/botrnot-an-r-app-to-detect-twitter-bots/    
      https://mikewk.shinyapps.io/botornot/    
    
      Article about Botometer   
      https://www.vox.com/technology/2018/4/9/17214720/pew-study-bots-generate-two-thirds-of-twitter-links   
    
      Stanford research paper on this topic    
      https://pdfs.semanticscholar.org/e219/6b47133c2191d380098744c13ba77133e625.pdf   

–30–